Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 4185749 |
| Missing cells | 39486179 |
| Missing cells (%) | 37.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.5 GiB |
| Average record size in memory | 1.1 KiB |
Variable types
| Numeric | 4 |
|---|---|
| DateTime | 2 |
| Text | 9 |
| Categorical | 10 |
DRIVER_LICENSE_STATUS is highly imbalanced (86.5%) | Imbalance |
VEHICLE_DAMAGE_3 is highly imbalanced (53.9%) | Imbalance |
PUBLIC_PROPERTY_DAMAGE is highly imbalanced (63.5%) | Imbalance |
STATE_REGISTRATION has 305429 (7.3%) missing values | Missing |
VEHICLE_TYPE has 237271 (5.7%) missing values | Missing |
VEHICLE_MAKE has 1881778 (45.0%) missing values | Missing |
VEHICLE_MODEL has 4134369 (98.8%) missing values | Missing |
VEHICLE_YEAR has 1901490 (45.4%) missing values | Missing |
TRAVEL_DIRECTION has 1668118 (39.9%) missing values | Missing |
VEHICLE_OCCUPANTS has 1782928 (42.6%) missing values | Missing |
DRIVER_SEX has 2221537 (53.1%) missing values | Missing |
DRIVER_LICENSE_STATUS has 2310803 (55.2%) missing values | Missing |
DRIVER_LICENSE_JURISDICTION has 2306176 (55.1%) missing values | Missing |
PRE_CRASH has 921425 (22.0%) missing values | Missing |
POINT_OF_IMPACT has 1701246 (40.6%) missing values | Missing |
VEHICLE_DAMAGE has 1725730 (41.2%) missing values | Missing |
VEHICLE_DAMAGE_1 has 2601039 (62.1%) missing values | Missing |
VEHICLE_DAMAGE_2 has 2991845 (71.5%) missing values | Missing |
VEHICLE_DAMAGE_3 has 3270248 (78.1%) missing values | Missing |
PUBLIC_PROPERTY_DAMAGE has 1528858 (36.5%) missing values | Missing |
PUBLIC_PROPERTY_DAMAGE_TYPE has 4159532 (99.4%) missing values | Missing |
CONTRIBUTING_FACTOR_1 has 148303 (3.5%) missing values | Missing |
CONTRIBUTING_FACTOR_2 has 1688054 (40.3%) missing values | Missing |
VEHICLE_YEAR is highly skewed (γ1 = 55.36312215) | Skewed |
VEHICLE_OCCUPANTS is highly skewed (γ1 = 1088.274577) | Skewed |
UNIQUE_ID has unique values | Unique |
VEHICLE_OCCUPANTS has 412632 (9.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-07 03:20:16.540189 |
|---|---|
| Analysis finished | 2024-05-07 03:22:46.485263 |
| Duration | 2 minutes and 29.95 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
UNIQUE_ID
Real number (ℝ)
UNIQUE 
| Distinct | 4185749 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16558039 |
| Minimum | 111711 |
|---|---|
| Maximum | 20645072 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 MiB |
Quantile statistics
| Minimum | 111711 |
|---|---|
| 5-th percentile | 9673053.4 |
| Q1 | 14562233 |
| median | 17550710 |
| Q3 | 19117378 |
| 95-th percentile | 20417546 |
| Maximum | 20645072 |
| Range | 20533361 |
| Interquartile range (IQR) | 4555145 |
Descriptive statistics
| Standard deviation | 3350117 |
|---|---|
| Coefficient of variation (CV) | 0.2023257 |
| Kurtosis | -0.38117072 |
| Mean | 16558039 |
| Median Absolute Deviation (MAD) | 2475734 |
| Skewness | -0.80951018 |
| Sum | 6.9307797 × 1013 |
| Variance | 1.1223284 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10385780 | 1 | < 0.1% |
| 19089622 | 1 | < 0.1% |
| 19016024 | 1 | < 0.1% |
| 17681000 | 1 | < 0.1% |
| 17620013 | 1 | < 0.1% |
| 17133151 | 1 | < 0.1% |
| 17632013 | 1 | < 0.1% |
| 17573075 | 1 | < 0.1% |
| 18705606 | 1 | < 0.1% |
| 18952545 | 1 | < 0.1% |
| Other values (4185739) | 4185739 |
| Value | Count | Frequency (%) |
| 111711 | 1 | |
| 111712 | 1 | |
| 115530 | 1 | |
| 115531 | 1 | |
| 120620 | 1 | |
| 123422 | 1 | |
| 123423 | 1 | |
| 199289 | 1 | |
| 199290 | 1 | |
| 199291 | 1 |
| Value | Count | Frequency (%) |
| 20645072 | 1 | |
| 20645071 | 1 | |
| 20645049 | 1 | |
| 20645048 | 1 | |
| 20645047 | 1 | |
| 20645040 | 1 | |
| 20645039 | 1 | |
| 20645038 | 1 | |
| 20645037 | 1 | |
| 20645036 | 1 |
COLLISION_ID
Real number (ℝ)
| Distinct | 2083567 |
|---|---|
| Distinct (%) | 49.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3181329.3 |
| Minimum | 22 |
|---|---|
| Maximum | 4722272 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 108378.4 |
| Q1 | 3163427 |
| median | 3687323 |
| Q3 | 4207061 |
| 95-th percentile | 4618128.6 |
| Maximum | 4722272 |
| Range | 4722250 |
| Interquartile range (IQR) | 1043634 |
Descriptive statistics
| Standard deviation | 1497302 |
|---|---|
| Coefficient of variation (CV) | 0.47065295 |
| Kurtosis | 0.041194458 |
| Mean | 3181329.3 |
| Median Absolute Deviation (MAD) | 521814 |
| Skewness | -1.2467723 |
| Sum | 1.3316246 × 1013 |
| Variance | 2.2419134 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4691158 | 42 | < 0.1% |
| 4539133 | 40 | < 0.1% |
| 4275782 | 25 | < 0.1% |
| 3925685 | 22 | < 0.1% |
| 4541337 | 22 | < 0.1% |
| 4324675 | 22 | < 0.1% |
| 306480 | 21 | < 0.1% |
| 4625450 | 20 | < 0.1% |
| 4578189 | 19 | < 0.1% |
| 3187017 | 19 | < 0.1% |
| Other values (2083557) | 4185497 |
| Value | Count | Frequency (%) |
| 22 | 2 | |
| 23 | 2 | |
| 24 | 2 | |
| 25 | 2 | |
| 26 | 2 | |
| 27 | 2 | |
| 28 | 2 | |
| 29 | 2 | |
| 30 | 2 | |
| 31 | 2 |
| Value | Count | Frequency (%) |
| 4722272 | 2 | |
| 4722270 | 2 | |
| 4722268 | 2 | |
| 4722265 | 1 | |
| 4722264 | 2 | |
| 4722263 | 2 | |
| 4722260 | 2 | |
| 4722259 | 1 | |
| 4722254 | 2 | |
| 4722253 | 2 |
CRASH_DATE
Date
| Distinct | 4325 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 MiB |
| Minimum | 2012-07-01 00:00:00 |
|---|---|
| Maximum | 2024-05-03 00:00:00 |
CRASH_TIME
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 MiB |
| Minimum | 2024-05-06 00:00:00 |
|---|---|
| Maximum | 2024-05-06 23:59:00 |
VEHICLE_ID
Text
| Distinct | 2656924 |
|---|---|
| Distinct (%) | 63.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 309.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 20.543244 |
| Min length | 1 |
Characters and Unicode
| Total characters | 85988861 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2656905 ? |
|---|---|
| Unique (%) | 63.5% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0553ab4d-9500-4cba-8d98-f4d7f89d5856 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 769061 | 18.4% |
| 2 | 694883 | 16.6% |
| 3 | 50530 | 1.2% |
| 4 | 10398 | 0.2% |
| 5 | 2608 | 0.1% |
| 6 | 791 | < 0.1% |
| 7 | 281 | < 0.1% |
| 8 | 130 | < 0.1% |
| 9 | 69 | < 0.1% |
| 10 | 36 | < 0.1% |
| Other values (2656914) | 2656962 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 9127904 | 10.6% |
| 4 | 6791276 | 7.9% |
| 1 | 5374630 | 6.3% |
| 2 | 5213477 | 6.1% |
| 8 | 5067349 | 5.9% |
| 9 | 5064080 | 5.9% |
| b | 4853328 | 5.6% |
| a | 4845628 | 5.6% |
| 3 | 4552258 | 5.3% |
| 5 | 4501708 | 5.2% |
| Other values (7) | 30597223 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 85988861 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 9127904 | 10.6% |
| 4 | 6791276 | 7.9% |
| 1 | 5374630 | 6.3% |
| 2 | 5213477 | 6.1% |
| 8 | 5067349 | 5.9% |
| 9 | 5064080 | 5.9% |
| b | 4853328 | 5.6% |
| a | 4845628 | 5.6% |
| 3 | 4552258 | 5.3% |
| 5 | 4501708 | 5.2% |
| Other values (7) | 30597223 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 85988861 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 9127904 | 10.6% |
| 4 | 6791276 | 7.9% |
| 1 | 5374630 | 6.3% |
| 2 | 5213477 | 6.1% |
| 8 | 5067349 | 5.9% |
| 9 | 5064080 | 5.9% |
| b | 4853328 | 5.6% |
| a | 4845628 | 5.6% |
| 3 | 4552258 | 5.3% |
| 5 | 4501708 | 5.2% |
| Other values (7) | 30597223 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 85988861 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 9127904 | 10.6% |
| 4 | 6791276 | 7.9% |
| 1 | 5374630 | 6.3% |
| 2 | 5213477 | 6.1% |
| 8 | 5067349 | 5.9% |
| 9 | 5064080 | 5.9% |
| b | 4853328 | 5.6% |
| a | 4845628 | 5.6% |
| 3 | 4552258 | 5.3% |
| 5 | 4501708 | 5.2% |
| Other values (7) | 30597223 |
MISSING 
| Distinct | 82 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 305429 |
| Missing (%) | 7.3% |
| Memory size | 227.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.9999997 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7760639 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | NY |
| 3rd row | NY |
| 4th row | NY |
| 5th row | NY |
| Value | Count | Frequency (%) |
| ny | 3232331 | |
| nj | 236709 | 6.1% |
| pa | 86926 | 2.2% |
| fl | 47820 | 1.2% |
| ct | 43817 | 1.1% |
| va | 19244 | 0.5% |
| ma | 18362 | 0.5% |
| md | 18150 | 0.5% |
| nc | 17039 | 0.4% |
| ga | 14182 | 0.4% |
| Other values (72) | 145740 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 3513550 | |
| Y | 3233546 | |
| J | 236709 | 3.1% |
| A | 160691 | 2.1% |
| P | 87944 | 1.1% |
| C | 76227 | 1.0% |
| T | 65899 | 0.8% |
| L | 62048 | 0.8% |
| M | 49564 | 0.6% |
| F | 48230 | 0.6% |
| Other values (16) | 226231 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7760639 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 3513550 | |
| Y | 3233546 | |
| J | 236709 | 3.1% |
| A | 160691 | 2.1% |
| P | 87944 | 1.1% |
| C | 76227 | 1.0% |
| T | 65899 | 0.8% |
| L | 62048 | 0.8% |
| M | 49564 | 0.6% |
| F | 48230 | 0.6% |
| Other values (16) | 226231 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7760639 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 3513550 | |
| Y | 3233546 | |
| J | 236709 | 3.1% |
| A | 160691 | 2.1% |
| P | 87944 | 1.1% |
| C | 76227 | 1.0% |
| T | 65899 | 0.8% |
| L | 62048 | 0.8% |
| M | 49564 | 0.6% |
| F | 48230 | 0.6% |
| Other values (16) | 226231 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7760639 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 3513550 | |
| Y | 3233546 | |
| J | 236709 | 3.1% |
| A | 160691 | 2.1% |
| P | 87944 | 1.1% |
| C | 76227 | 1.0% |
| T | 65899 | 0.8% |
| L | 62048 | 0.8% |
| M | 49564 | 0.6% |
| F | 48230 | 0.6% |
| Other values (16) | 226231 | 2.9% |
VEHICLE_TYPE
Text
MISSING 
| Distinct | 2725 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 237271 |
| Missing (%) | 5.7% |
| Memory size | 284.3 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 30 |
| Mean length | 16.576971 |
| Min length | 1 |
Characters and Unicode
| Total characters | 65453806 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1652 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PASSENGER VEHICLE |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | TAXI |
| 4th row | PASSENGER VEHICLE |
| 5th row | PASSENGER VEHICLE |
| Value | Count | Frequency (%) |
| vehicle | 1626128 | |
| utility | 1173683 | |
| station | 1173610 | |
| sedan | 1126807 | |
| wagon/sport | 835682 | |
| passenger | 770775 | |
| 340727 | 3.7% | |
| wagon | 338047 | 3.7% |
| sport | 337927 | 3.7% |
| truck | 177854 | 1.9% |
| Other values (1445) | 1311969 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5264731 | 8.0% | |
| S | 5055296 | 7.7% |
| t | 4247518 | 6.5% |
| i | 3601749 | 5.5% |
| E | 3408377 | 5.2% |
| e | 2990202 | 4.6% |
| a | 2975013 | 4.5% |
| n | 2840400 | 4.3% |
| o | 2668562 | 4.1% |
| T | 2162950 | 3.3% |
| Other values (65) | 30239008 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 65453806 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5264731 | 8.0% | |
| S | 5055296 | 7.7% |
| t | 4247518 | 6.5% |
| i | 3601749 | 5.5% |
| E | 3408377 | 5.2% |
| e | 2990202 | 4.6% |
| a | 2975013 | 4.5% |
| n | 2840400 | 4.3% |
| o | 2668562 | 4.1% |
| T | 2162950 | 3.3% |
| Other values (65) | 30239008 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 65453806 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5264731 | 8.0% | |
| S | 5055296 | 7.7% |
| t | 4247518 | 6.5% |
| i | 3601749 | 5.5% |
| E | 3408377 | 5.2% |
| e | 2990202 | 4.6% |
| a | 2975013 | 4.5% |
| n | 2840400 | 4.3% |
| o | 2668562 | 4.1% |
| T | 2162950 | 3.3% |
| Other values (65) | 30239008 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 65453806 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5264731 | 8.0% | |
| S | 5055296 | 7.7% |
| t | 4247518 | 6.5% |
| i | 3601749 | 5.5% |
| E | 3408377 | 5.2% |
| e | 2990202 | 4.6% |
| a | 2975013 | 4.5% |
| n | 2840400 | 4.3% |
| o | 2668562 | 4.1% |
| T | 2162950 | 3.3% |
| Other values (65) | 30239008 |
VEHICLE_MAKE
Text
MISSING 
| Distinct | 12874 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1881778 |
| Missing (%) | 45.0% |
| Memory size | 210.6 MiB |
Length
| Max length | 52 |
|---|---|
| Median length | 13 |
| Mean length | 12.693347 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29245103 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9000 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | TOYT -CAR/SUV |
|---|---|
| 2nd row | MERZ -CAR/SUV |
| 3rd row | FRHT-TRUCK/BUS |
| 4th row | FORD -CAR/SUV |
| 5th row | VOLK -CAR/SUV |
| Value | Count | Frequency (%) |
| car/suv | 2073789 | |
| toyt | 394420 | 8.9% |
| hond | 286537 | 6.5% |
| niss | 233661 | 5.3% |
| ford | 200368 | 4.5% |
| chev | 110723 | 2.5% |
| hyun | 82015 | 1.9% |
| bmw | 78101 | 1.8% |
| merz | 76151 | 1.7% |
| jeep | 75264 | 1.7% |
| Other values (6814) | 812645 | 18.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2819603 | |
| R | 2667135 | |
| U | 2560537 | 8.8% |
| C | 2525143 | 8.6% |
| A | 2311261 | 7.9% |
| V | 2260432 | 7.7% |
| - | 2214931 | 7.6% |
| / | 2204446 | 7.5% |
| 2119703 | 7.2% | |
| O | 1060624 | 3.6% |
| Other values (70) | 6501288 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29245103 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 2819603 | |
| R | 2667135 | |
| U | 2560537 | 8.8% |
| C | 2525143 | 8.6% |
| A | 2311261 | 7.9% |
| V | 2260432 | 7.7% |
| - | 2214931 | 7.6% |
| / | 2204446 | 7.5% |
| 2119703 | 7.2% | |
| O | 1060624 | 3.6% |
| Other values (70) | 6501288 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29245103 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 2819603 | |
| R | 2667135 | |
| U | 2560537 | 8.8% |
| C | 2525143 | 8.6% |
| A | 2311261 | 7.9% |
| V | 2260432 | 7.7% |
| - | 2214931 | 7.6% |
| / | 2204446 | 7.5% |
| 2119703 | 7.2% | |
| O | 1060624 | 3.6% |
| Other values (70) | 6501288 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29245103 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 2819603 | |
| R | 2667135 | |
| U | 2560537 | 8.8% |
| C | 2525143 | 8.6% |
| A | 2311261 | 7.9% |
| V | 2260432 | 7.7% |
| - | 2214931 | 7.6% |
| / | 2204446 | 7.5% |
| 2119703 | 7.2% | |
| O | 1060624 | 3.6% |
| Other values (70) | 6501288 |
VEHICLE_MODEL
Text
MISSING 
| Distinct | 2429 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 4134369 |
| Missing (%) | 98.8% |
| Memory size | 129.3 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 8 |
| Mean length | 7.5591086 |
| Min length | 1 |
Characters and Unicode
| Total characters | 388387 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1327 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | TOYT 4RN |
|---|---|
| 2nd row | FORD ZZZ |
| 3rd row | TRUCK TRADE |
| 4th row | DODG CHA |
| 5th row | town and country |
| Value | Count | Frequency (%) |
| zzz | 9213 | 9.7% |
| toyt | 8644 | 9.1% |
| hond | 5999 | 6.3% |
| niss | 5220 | 5.5% |
| ford | 4930 | 5.2% |
| cam | 3092 | 3.3% |
| chev | 2681 | 2.8% |
| acc | 1899 | 2.0% |
| hyun | 1575 | 1.7% |
| alt | 1532 | 1.6% |
| Other values (1769) | 50052 |
Most occurring characters
| Value | Count | Frequency (%) |
| 43457 | 11.2% | |
| Z | 32695 | 8.4% |
| T | 27048 | 7.0% |
| O | 25820 | 6.6% |
| C | 22245 | 5.7% |
| N | 21375 | 5.5% |
| S | 18775 | 4.8% |
| A | 17553 | 4.5% |
| D | 17438 | 4.5% |
| R | 16184 | 4.2% |
| Other values (63) | 145797 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 388387 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 43457 | 11.2% | |
| Z | 32695 | 8.4% |
| T | 27048 | 7.0% |
| O | 25820 | 6.6% |
| C | 22245 | 5.7% |
| N | 21375 | 5.5% |
| S | 18775 | 4.8% |
| A | 17553 | 4.5% |
| D | 17438 | 4.5% |
| R | 16184 | 4.2% |
| Other values (63) | 145797 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 388387 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 43457 | 11.2% | |
| Z | 32695 | 8.4% |
| T | 27048 | 7.0% |
| O | 25820 | 6.6% |
| C | 22245 | 5.7% |
| N | 21375 | 5.5% |
| S | 18775 | 4.8% |
| A | 17553 | 4.5% |
| D | 17438 | 4.5% |
| R | 16184 | 4.2% |
| Other values (63) | 145797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 388387 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 43457 | 11.2% | |
| Z | 32695 | 8.4% |
| T | 27048 | 7.0% |
| O | 25820 | 6.6% |
| C | 22245 | 5.7% |
| N | 21375 | 5.5% |
| S | 18775 | 4.8% |
| A | 17553 | 4.5% |
| D | 17438 | 4.5% |
| R | 16184 | 4.2% |
| Other values (63) | 145797 |
VEHICLE_YEAR
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 321 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1901490 |
| Missing (%) | 45.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.1493 |
| Minimum | 1000 |
|---|---|
| Maximum | 20063 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 MiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2008 |
| median | 2014 |
| Q3 | 2017 |
| 95-th percentile | 2020 |
| Maximum | 20063 |
| Range | 19063 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 148.32851 |
|---|---|
| Coefficient of variation (CV) | 0.073606707 |
| Kurtosis | 3275.7836 |
| Mean | 2015.1493 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 55.363122 |
| Sum | 4.603123 × 109 |
| Variance | 22001.346 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2016 | 221129 | 5.3% |
| 2015 | 218791 | 5.2% |
| 2017 | 200176 | 4.8% |
| 2014 | 161252 | 3.9% |
| 2013 | 137632 | 3.3% |
| 2018 | 137172 | 3.3% |
| 2012 | 109886 | 2.6% |
| 2011 | 98656 | 2.4% |
| 2019 | 96930 | 2.3% |
| 2007 | 90282 | 2.2% |
| Other values (311) | 812353 | |
| (Missing) | 1901490 |
| Value | Count | Frequency (%) |
| 1000 | 1 | < 0.1% |
| 1111 | 2 | < 0.1% |
| 1900 | 7 | |
| 1920 | 2 | < 0.1% |
| 1921 | 1 | < 0.1% |
| 1923 | 1 | < 0.1% |
| 1926 | 1 | < 0.1% |
| 1930 | 1 | < 0.1% |
| 1931 | 1 | < 0.1% |
| 1932 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 20063 | 1 | < 0.1% |
| 20015 | 2 | < 0.1% |
| 20009 | 1 | < 0.1% |
| 20003 | 1 | < 0.1% |
| 19969 | 1 | < 0.1% |
| 9999 | 728 | |
| 9972 | 1 | < 0.1% |
| 9699 | 1 | < 0.1% |
| 9019 | 1 | < 0.1% |
| 8888 | 1 | < 0.1% |
TRAVEL_DIRECTION
Categorical
MISSING 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1668118 |
| Missing (%) | 39.9% |
| Memory size | 250.2 MiB |
| West | |
|---|---|
| East | |
| North | |
| South | |
| Unknown | |
| Other values (10) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 4.8151703 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12122822 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | North |
|---|---|
| 2nd row | East |
| 3rd row | East |
| 4th row | Southwest |
| 5th row | South |
Common Values
| Value | Count | Frequency (%) |
| West | 578302 | 13.8% |
| East | 576123 | 13.8% |
| North | 575683 | 13.8% |
| South | 569823 | 13.6% |
| Unknown | 83866 | 2.0% |
| Northeast | 35085 | 0.8% |
| Southeast | 33374 | 0.8% |
| Southwest | 32658 | 0.8% |
| Northwest | 30970 | 0.7% |
| - | 1003 | < 0.1% |
| Other values (5) | 744 | < 0.1% |
| (Missing) | 1668118 |
Length
| Value | Count | Frequency (%) |
| west | 578302 | |
| east | 576123 | |
| north | 575683 | |
| south | 569823 | |
| unknown | 83866 | 3.3% |
| northeast | 35085 | 1.4% |
| southeast | 33374 | 1.3% |
| southwest | 32658 | 1.3% |
| northwest | 30970 | 1.2% |
| 1003 | < 0.1% | |
| Other values (5) | 744 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2564105 | |
| o | 1361459 | |
| s | 1286512 | |
| h | 1277593 | |
| e | 710389 | 5.9% |
| a | 644582 | 5.3% |
| N | 641915 | 5.3% |
| r | 641738 | 5.3% |
| S | 636061 | 5.2% |
| u | 635855 | 5.2% |
| Other values (7) | 1722613 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12122822 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 2564105 | |
| o | 1361459 | |
| s | 1286512 | |
| h | 1277593 | |
| e | 710389 | 5.9% |
| a | 644582 | 5.3% |
| N | 641915 | 5.3% |
| r | 641738 | 5.3% |
| S | 636061 | 5.2% |
| u | 635855 | 5.2% |
| Other values (7) | 1722613 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12122822 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 2564105 | |
| o | 1361459 | |
| s | 1286512 | |
| h | 1277593 | |
| e | 710389 | 5.9% |
| a | 644582 | 5.3% |
| N | 641915 | 5.3% |
| r | 641738 | 5.3% |
| S | 636061 | 5.2% |
| u | 635855 | 5.2% |
| Other values (7) | 1722613 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12122822 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 2564105 | |
| o | 1361459 | |
| s | 1286512 | |
| h | 1277593 | |
| e | 710389 | 5.9% |
| a | 644582 | 5.3% |
| N | 641915 | 5.3% |
| r | 641738 | 5.3% |
| S | 636061 | 5.2% |
| u | 635855 | 5.2% |
| Other values (7) | 1722613 |
VEHICLE_OCCUPANTS
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 133 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1782928 |
| Missing (%) | 42.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 879.64076 |
| Minimum | 0 |
|---|---|
| Maximum | 1 × 109 |
| Zeros | 412632 |
| Zeros (%) | 9.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 1 × 109 |
| Range | 1 × 109 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 906508.38 |
|---|---|
| Coefficient of variation (CV) | 1030.5439 |
| Kurtosis | 1189444.3 |
| Mean | 879.64076 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1088.2746 |
| Sum | 2.1136193 × 109 |
| Variance | 8.2175744 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1515162 | |
| 0 | 412632 | 9.9% |
| 2 | 318900 | 7.6% |
| 3 | 91605 | 2.2% |
| 4 | 38149 | 0.9% |
| 5 | 13947 | 0.3% |
| 6 | 4584 | 0.1% |
| 7 | 2028 | < 0.1% |
| 8 | 1203 | < 0.1% |
| 9 | 843 | < 0.1% |
| Other values (123) | 3768 | 0.1% |
| (Missing) | 1782928 |
| Value | Count | Frequency (%) |
| 0 | 412632 | 9.9% |
| 1 | 1515162 | |
| 2 | 318900 | 7.6% |
| 3 | 91605 | 2.2% |
| 4 | 38149 | 0.9% |
| 5 | 13947 | 0.3% |
| 6 | 4584 | 0.1% |
| 7 | 2028 | < 0.1% |
| 8 | 1203 | < 0.1% |
| 9 | 843 | < 0.1% |
| Value | Count | Frequency (%) |
| 999999999 | 1 | < 0.1% |
| 981990849 | 1 | < 0.1% |
| 99999999 | 1 | < 0.1% |
| 9999999 | 2 | < 0.1% |
| 5292023 | 1 | < 0.1% |
| 999999 | 3 | < 0.1% |
| 99999 | 3 | < 0.1% |
| 24260 | 1 | < 0.1% |
| 9999 | 16 | |
| 2017 | 2 | < 0.1% |
DRIVER_SEX
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2221537 |
| Missing (%) | 53.1% |
| Memory size | 244.2 MiB |
| M | |
|---|---|
| F | |
| U | 8222 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1964212 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 1453033 | |
| F | 502957 | 12.0% |
| U | 8222 | 0.2% |
| (Missing) | 2221537 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 1453033 | |
| f | 502957 | 25.6% |
| u | 8222 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1453033 | |
| F | 502957 | 25.6% |
| U | 8222 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1964212 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 1453033 | |
| F | 502957 | 25.6% |
| U | 8222 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1964212 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 1453033 | |
| F | 502957 | 25.6% |
| U | 8222 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1964212 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 1453033 | |
| F | 502957 | 25.6% |
| U | 8222 | 0.4% |
DRIVER_LICENSE_STATUS
Categorical
IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2310803 |
| Missing (%) | 55.2% |
| Memory size | 257.3 MiB |
| Licensed | |
|---|---|
| Unlicensed | 37378 |
| Permit | 16688 |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.02207 |
| Min length | 6 |
Characters and Unicode
| Total characters | 15040948 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Licensed |
|---|---|
| 2nd row | Licensed |
| 3rd row | Licensed |
| 4th row | Licensed |
| 5th row | Licensed |
Common Values
| Value | Count | Frequency (%) |
| Licensed | 1820880 | |
| Unlicensed | 37378 | 0.9% |
| Permit | 16688 | 0.4% |
| (Missing) | 2310803 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| licensed | 1820880 | |
| unlicensed | 37378 | 2.0% |
| permit | 16688 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3733204 | |
| n | 1895636 | |
| i | 1874946 | |
| c | 1858258 | |
| s | 1858258 | |
| d | 1858258 | |
| L | 1820880 | |
| U | 37378 | 0.2% |
| l | 37378 | 0.2% |
| P | 16688 | 0.1% |
| Other values (3) | 50064 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15040948 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 3733204 | |
| n | 1895636 | |
| i | 1874946 | |
| c | 1858258 | |
| s | 1858258 | |
| d | 1858258 | |
| L | 1820880 | |
| U | 37378 | 0.2% |
| l | 37378 | 0.2% |
| P | 16688 | 0.1% |
| Other values (3) | 50064 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15040948 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 3733204 | |
| n | 1895636 | |
| i | 1874946 | |
| c | 1858258 | |
| s | 1858258 | |
| d | 1858258 | |
| L | 1820880 | |
| U | 37378 | 0.2% |
| l | 37378 | 0.2% |
| P | 16688 | 0.1% |
| Other values (3) | 50064 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15040948 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 3733204 | |
| n | 1895636 | |
| i | 1874946 | |
| c | 1858258 | |
| s | 1858258 | |
| d | 1858258 | |
| L | 1820880 | |
| U | 37378 | 0.2% |
| l | 37378 | 0.2% |
| P | 16688 | 0.1% |
| Other values (3) | 50064 | 0.3% |
DRIVER_LICENSE_JURISDICTION
Text
MISSING 
| Distinct | 72 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2306176 |
| Missing (%) | 55.1% |
| Memory size | 176.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 2 |
| Mean length | 2.0027288 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3764275 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | FL |
| 3rd row | NY |
| 4th row | NY |
| 5th row | NY |
| Value | Count | Frequency (%) |
| ny | 1619478 | |
| nj | 106707 | 5.7% |
| pa | 33393 | 1.8% |
| ct | 20922 | 1.1% |
| fl | 19940 | 1.1% |
| md | 10792 | 0.6% |
| nc | 6626 | 0.4% |
| ma | 6308 | 0.3% |
| ga | 6221 | 0.3% |
| va | 6180 | 0.3% |
| Other values (61) | 43006 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1739716 | |
| Y | 1619873 | |
| J | 106708 | 2.8% |
| A | 62189 | 1.7% |
| C | 35669 | 0.9% |
| P | 34039 | 0.9% |
| T | 25654 | 0.7% |
| L | 23016 | 0.6% |
| M | 21899 | 0.6% |
| F | 20260 | 0.5% |
| Other values (20) | 75252 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3764275 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 1739716 | |
| Y | 1619873 | |
| J | 106708 | 2.8% |
| A | 62189 | 1.7% |
| C | 35669 | 0.9% |
| P | 34039 | 0.9% |
| T | 25654 | 0.7% |
| L | 23016 | 0.6% |
| M | 21899 | 0.6% |
| F | 20260 | 0.5% |
| Other values (20) | 75252 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3764275 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 1739716 | |
| Y | 1619873 | |
| J | 106708 | 2.8% |
| A | 62189 | 1.7% |
| C | 35669 | 0.9% |
| P | 34039 | 0.9% |
| T | 25654 | 0.7% |
| L | 23016 | 0.6% |
| M | 21899 | 0.6% |
| F | 20260 | 0.5% |
| Other values (20) | 75252 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3764275 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 1739716 | |
| Y | 1619873 | |
| J | 106708 | 2.8% |
| A | 62189 | 1.7% |
| C | 35669 | 0.9% |
| P | 34039 | 0.9% |
| T | 25654 | 0.7% |
| L | 23016 | 0.6% |
| M | 21899 | 0.6% |
| F | 20260 | 0.5% |
| Other values (20) | 75252 | 2.0% |
PRE_CRASH
Categorical
MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 921425 |
| Missing (%) | 22.0% |
| Memory size | 283.4 MiB |
| Going Straight Ahead | |
|---|---|
| Parked | |
| Making Left Turn | |
| Making Right Turn | |
| Stopped in Traffic | 150370 |
| Other values (14) |
Length
| Max length | 26 |
|---|---|
| Median length | 24 |
| Mean length | 15.956158 |
| Min length | 6 |
Characters and Unicode
| Total characters | 52086070 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Going Straight Ahead |
|---|---|
| 2nd row | Going Straight Ahead |
| 3rd row | Parked |
| 4th row | Merging |
| 5th row | Parked |
Common Values
| Value | Count | Frequency (%) |
| Going Straight Ahead | 1598318 | |
| Parked | 561177 | 13.4% |
| Making Left Turn | 201185 | 4.8% |
| Making Right Turn | 165325 | 3.9% |
| Stopped in Traffic | 150370 | 3.6% |
| Slowing or Stopping | 115597 | 2.8% |
| Backing | 111297 | 2.7% |
| Changing Lanes | 95659 | 2.3% |
| Starting from Parking | 53394 | 1.3% |
| Merging | 53182 | 1.3% |
| Other values (9) | 158820 | 3.8% |
| (Missing) | 921425 |
Length
| Value | Count | Frequency (%) |
| going | 1598318 | |
| straight | 1598318 | |
| ahead | 1598318 | |
| parked | 602057 | 7.4% |
| making | 397088 | 4.9% |
| turn | 397088 | 4.9% |
| left | 202187 | 2.5% |
| in | 166924 | 2.1% |
| right | 166218 | 2.0% |
| traffic | 163363 | 2.0% |
| Other values (23) | 1223246 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4868299 | 9.3% |
| 4848801 | 9.3% | |
| a | 4823079 | 9.3% |
| g | 4598754 | 8.8% |
| t | 4085494 | 7.8% |
| n | 3524362 | 6.8% |
| h | 3493515 | 6.7% |
| r | 3180051 | 6.1% |
| e | 2784500 | 5.3% |
| d | 2359762 | 4.5% |
| Other values (28) | 13519453 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 52086070 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 4868299 | 9.3% |
| 4848801 | 9.3% | |
| a | 4823079 | 9.3% |
| g | 4598754 | 8.8% |
| t | 4085494 | 7.8% |
| n | 3524362 | 6.8% |
| h | 3493515 | 6.7% |
| r | 3180051 | 6.1% |
| e | 2784500 | 5.3% |
| d | 2359762 | 4.5% |
| Other values (28) | 13519453 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 52086070 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 4868299 | 9.3% |
| 4848801 | 9.3% | |
| a | 4823079 | 9.3% |
| g | 4598754 | 8.8% |
| t | 4085494 | 7.8% |
| n | 3524362 | 6.8% |
| h | 3493515 | 6.7% |
| r | 3180051 | 6.1% |
| e | 2784500 | 5.3% |
| d | 2359762 | 4.5% |
| Other values (28) | 13519453 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 52086070 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 4868299 | 9.3% |
| 4848801 | 9.3% | |
| a | 4823079 | 9.3% |
| g | 4598754 | 8.8% |
| t | 4085494 | 7.8% |
| n | 3524362 | 6.8% |
| h | 3493515 | 6.7% |
| r | 3180051 | 6.1% |
| e | 2784500 | 5.3% |
| d | 2359762 | 4.5% |
| Other values (28) | 13519453 |
POINT_OF_IMPACT
Categorical
MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1701246 |
| Missing (%) | 40.6% |
| Memory size | 281.0 MiB |
| Center Front End | |
|---|---|
| Left Front Bumper | |
| Center Back End | |
| Right Front Bumper | |
| Right Front Quarter Panel | |
| Other values (14) |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 17.77199 |
| Min length | 4 |
Characters and Unicode
| Total characters | 44154563 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Left Front Bumper |
|---|---|
| 2nd row | Right Front Bumper |
| 3rd row | Left Front Quarter Panel |
| 4th row | Center Front End |
| 5th row | Right Rear Bumper |
Common Values
| Value | Count | Frequency (%) |
| Center Front End | 429564 | 10.3% |
| Left Front Bumper | 314480 | 7.5% |
| Center Back End | 299492 | 7.2% |
| Right Front Bumper | 277632 | 6.6% |
| Right Front Quarter Panel | 177706 | 4.2% |
| Left Front Quarter Panel | 175025 | 4.2% |
| Left Rear Quarter Panel | 142767 | 3.4% |
| Left Side Doors | 131452 | 3.1% |
| Left Rear Bumper | 130652 | 3.1% |
| Right Side Doors | 109876 | 2.6% |
| Other values (9) | 295857 | 7.1% |
| (Missing) | 1701246 |
Length
| Value | Count | Frequency (%) |
| front | 1374407 | |
| left | 894376 | |
| bumper | 809576 | |
| right | 753628 | |
| center | 729056 | |
| end | 729056 | |
| quarter | 597100 | |
| panel | 597100 | |
| rear | 461833 | 5.9% |
| back | 299492 | 3.8% |
| Other values (10) | 643797 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5404918 | ||
| e | 5166730 | |
| r | 4866822 | 11.0% |
| t | 4390163 | 9.9% |
| n | 3432995 | 7.8% |
| a | 2070255 | 4.7% |
| o | 1922154 | 4.4% |
| u | 1408443 | 3.2% |
| F | 1374407 | 3.1% |
| R | 1220430 | 2.8% |
| Other values (24) | 12897246 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 44154563 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5404918 | ||
| e | 5166730 | |
| r | 4866822 | 11.0% |
| t | 4390163 | 9.9% |
| n | 3432995 | 7.8% |
| a | 2070255 | 4.7% |
| o | 1922154 | 4.4% |
| u | 1408443 | 3.2% |
| F | 1374407 | 3.1% |
| R | 1220430 | 2.8% |
| Other values (24) | 12897246 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 44154563 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5404918 | ||
| e | 5166730 | |
| r | 4866822 | 11.0% |
| t | 4390163 | 9.9% |
| n | 3432995 | 7.8% |
| a | 2070255 | 4.7% |
| o | 1922154 | 4.4% |
| u | 1408443 | 3.2% |
| F | 1374407 | 3.1% |
| R | 1220430 | 2.8% |
| Other values (24) | 12897246 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 44154563 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5404918 | ||
| e | 5166730 | |
| r | 4866822 | 11.0% |
| t | 4390163 | 9.9% |
| n | 3432995 | 7.8% |
| a | 2070255 | 4.7% |
| o | 1922154 | 4.4% |
| u | 1408443 | 3.2% |
| F | 1374407 | 3.1% |
| R | 1220430 | 2.8% |
| Other values (24) | 12897246 |
VEHICLE_DAMAGE
Categorical
MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1725730 |
| Missing (%) | 41.2% |
| Memory size | 279.2 MiB |
| Center Front End | |
|---|---|
| Left Front Bumper | |
| Center Back End | |
| Right Front Bumper | |
| No Damage | |
| Other values (14) |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 17.114411 |
| Min length | 4 |
Characters and Unicode
| Total characters | 42101776 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Left Front Quarter Panel |
|---|---|
| 2nd row | Right Front Bumper |
| 3rd row | Left Front Quarter Panel |
| 4th row | Center Front End |
| 5th row | Right Rear Bumper |
Common Values
| Value | Count | Frequency (%) |
| Center Front End | 382597 | 9.1% |
| Left Front Bumper | 259182 | 6.2% |
| Center Back End | 255987 | 6.1% |
| Right Front Bumper | 238256 | 5.7% |
| No Damage | 232503 | 5.6% |
| Left Front Quarter Panel | 171577 | 4.1% |
| Right Front Quarter Panel | 166860 | 4.0% |
| Left Rear Quarter Panel | 136235 | 3.3% |
| Left Side Doors | 135595 | 3.2% |
| Left Rear Bumper | 125079 | 3.0% |
| Other values (9) | 356148 | 8.5% |
| (Missing) | 1725730 |
Length
| Value | Count | Frequency (%) |
| front | 1218472 | |
| left | 827668 | |
| bumper | 709483 | |
| right | 698601 | |
| center | 638584 | |
| end | 638584 | |
| quarter | 568019 | |
| panel | 568019 | |
| rear | 441627 | 5.8% |
| back | 255987 | 3.4% |
| Other values (10) | 1025203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5130228 | ||
| e | 4940087 | 11.7% |
| r | 4456135 | 10.6% |
| t | 4000089 | 9.5% |
| n | 3068608 | 7.3% |
| a | 2305580 | 5.5% |
| o | 1962673 | 4.7% |
| u | 1280264 | 3.0% |
| F | 1218472 | 2.9% |
| R | 1145209 | 2.7% |
| Other values (24) | 12594431 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42101776 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5130228 | ||
| e | 4940087 | 11.7% |
| r | 4456135 | 10.6% |
| t | 4000089 | 9.5% |
| n | 3068608 | 7.3% |
| a | 2305580 | 5.5% |
| o | 1962673 | 4.7% |
| u | 1280264 | 3.0% |
| F | 1218472 | 2.9% |
| R | 1145209 | 2.7% |
| Other values (24) | 12594431 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42101776 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5130228 | ||
| e | 4940087 | 11.7% |
| r | 4456135 | 10.6% |
| t | 4000089 | 9.5% |
| n | 3068608 | 7.3% |
| a | 2305580 | 5.5% |
| o | 1962673 | 4.7% |
| u | 1280264 | 3.0% |
| F | 1218472 | 2.9% |
| R | 1145209 | 2.7% |
| Other values (24) | 12594431 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42101776 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5130228 | ||
| e | 4940087 | 11.7% |
| r | 4456135 | 10.6% |
| t | 4000089 | 9.5% |
| n | 3068608 | 7.3% |
| a | 2305580 | 5.5% |
| o | 1962673 | 4.7% |
| u | 1280264 | 3.0% |
| F | 1218472 | 2.9% |
| R | 1145209 | 2.7% |
| Other values (24) | 12594431 |
VEHICLE_DAMAGE_1
Categorical
MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2601039 |
| Missing (%) | 62.1% |
| Memory size | 268.7 MiB |
| No Damage | |
|---|---|
| Left Front Bumper | |
| Center Front End | |
| Right Front Bumper | |
| Left Front Quarter Panel | |
| Other values (14) |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 15.71644 |
| Min length | 4 |
Characters and Unicode
| Total characters | 24906000 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Right Front Quarter Panel |
|---|---|
| 2nd row | No Damage |
| 3rd row | Center Back End |
| 4th row | Left Rear Quarter Panel |
| 5th row | Right Front Quarter Panel |
Common Values
| Value | Count | Frequency (%) |
| No Damage | 437596 | 10.5% |
| Left Front Bumper | 158265 | 3.8% |
| Center Front End | 149772 | 3.6% |
| Right Front Bumper | 126608 | 3.0% |
| Left Front Quarter Panel | 100756 | 2.4% |
| Right Front Quarter Panel | 92310 | 2.2% |
| Left Rear Bumper | 83214 | 2.0% |
| Right Rear Bumper | 77745 | 1.9% |
| Left Rear Quarter Panel | 71720 | 1.7% |
| Left Side Doors | 71277 | 1.7% |
| Other values (9) | 215447 | 5.1% |
| (Missing) | 2601039 |
Length
| Value | Count | Frequency (%) |
| front | 627711 | |
| left | 485232 | |
| bumper | 445832 | |
| no | 437596 | |
| damage | 437596 | |
| right | 415584 | |
| quarter | 320917 | |
| panel | 320917 | |
| rear | 288810 | |
| end | 212985 | 4.7% |
| Other values (10) | 577645 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2986115 | ||
| e | 2897089 | 11.6% |
| r | 2386084 | 9.6% |
| t | 2088300 | 8.4% |
| a | 1874100 | 7.5% |
| n | 1378134 | 5.5% |
| o | 1340076 | 5.4% |
| m | 886239 | 3.6% |
| g | 855512 | 3.4% |
| u | 767953 | 3.1% |
| Other values (24) | 7446398 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 24906000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2986115 | ||
| e | 2897089 | 11.6% |
| r | 2386084 | 9.6% |
| t | 2088300 | 8.4% |
| a | 1874100 | 7.5% |
| n | 1378134 | 5.5% |
| o | 1340076 | 5.4% |
| m | 886239 | 3.6% |
| g | 855512 | 3.4% |
| u | 767953 | 3.1% |
| Other values (24) | 7446398 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 24906000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2986115 | ||
| e | 2897089 | 11.6% |
| r | 2386084 | 9.6% |
| t | 2088300 | 8.4% |
| a | 1874100 | 7.5% |
| n | 1378134 | 5.5% |
| o | 1340076 | 5.4% |
| m | 886239 | 3.6% |
| g | 855512 | 3.4% |
| u | 767953 | 3.1% |
| Other values (24) | 7446398 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 24906000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2986115 | ||
| e | 2897089 | 11.6% |
| r | 2386084 | 9.6% |
| t | 2088300 | 8.4% |
| a | 1874100 | 7.5% |
| n | 1378134 | 5.5% |
| o | 1340076 | 5.4% |
| m | 886239 | 3.6% |
| g | 855512 | 3.4% |
| u | 767953 | 3.1% |
| Other values (24) | 7446398 |
VEHICLE_DAMAGE_2
Categorical
MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2991845 |
| Missing (%) | 71.5% |
| Memory size | 263.1 MiB |
| No Damage | |
|---|---|
| Right Front Bumper | |
| Left Front Bumper | |
| Center Front End | |
| Left Rear Bumper | |
| Other values (14) |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 13.734143 |
| Min length | 4 |
Characters and Unicode
| Total characters | 16397248 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Damage |
|---|---|
| 2nd row | Left Rear Bumper |
| 3rd row | Right Front Bumper |
| 4th row | No Damage |
| 5th row | No Damage |
Common Values
| Value | Count | Frequency (%) |
| No Damage | 563895 | 13.5% |
| Right Front Bumper | 119786 | 2.9% |
| Left Front Bumper | 71209 | 1.7% |
| Center Front End | 59219 | 1.4% |
| Left Rear Bumper | 58982 | 1.4% |
| Left Front Quarter Panel | 44918 | 1.1% |
| Right Rear Bumper | 42469 | 1.0% |
| Right Front Quarter Panel | 39256 | 0.9% |
| Left Rear Quarter Panel | 37599 | 0.9% |
| Right Rear Quarter Panel | 37364 | 0.9% |
| Other values (9) | 119207 | 2.8% |
| (Missing) | 2991845 |
Length
| Value | Count | Frequency (%) |
| no | 563895 | |
| damage | 563895 | |
| front | 334388 | |
| bumper | 292446 | |
| right | 264548 | |
| left | 242613 | |
| rear | 176414 | 5.7% |
| quarter | 159137 | 5.1% |
| panel | 159137 | 5.1% |
| end | 90685 | 2.9% |
| Other values (10) | 265470 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1918724 | ||
| e | 1866194 | |
| a | 1658426 | 10.1% |
| r | 1302128 | 7.9% |
| t | 1118109 | 6.8% |
| o | 1013831 | 6.2% |
| m | 858137 | 5.2% |
| g | 830594 | 5.1% |
| n | 677838 | 4.1% |
| D | 621269 | 3.8% |
| Other values (24) | 4531998 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16397248 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1918724 | ||
| e | 1866194 | |
| a | 1658426 | 10.1% |
| r | 1302128 | 7.9% |
| t | 1118109 | 6.8% |
| o | 1013831 | 6.2% |
| m | 858137 | 5.2% |
| g | 830594 | 5.1% |
| n | 677838 | 4.1% |
| D | 621269 | 3.8% |
| Other values (24) | 4531998 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16397248 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1918724 | ||
| e | 1866194 | |
| a | 1658426 | 10.1% |
| r | 1302128 | 7.9% |
| t | 1118109 | 6.8% |
| o | 1013831 | 6.2% |
| m | 858137 | 5.2% |
| g | 830594 | 5.1% |
| n | 677838 | 4.1% |
| D | 621269 | 3.8% |
| Other values (24) | 4531998 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16397248 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1918724 | ||
| e | 1866194 | |
| a | 1658426 | 10.1% |
| r | 1302128 | 7.9% |
| t | 1118109 | 6.8% |
| o | 1013831 | 6.2% |
| m | 858137 | 5.2% |
| g | 830594 | 5.1% |
| n | 677838 | 4.1% |
| D | 621269 | 3.8% |
| Other values (24) | 4531998 |
VEHICLE_DAMAGE_3
Categorical
IMBALANCE  MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3270248 |
| Missing (%) | 78.1% |
| Memory size | 259.3 MiB |
| No Damage | |
|---|---|
| Center Front End | 31656 |
| Other | 31361 |
| Right Front Bumper | 26206 |
| Left Front Bumper | 25271 |
| Other values (14) |
Length
| Max length | 25 |
|---|---|
| Median length | 9 |
| Mean length | 11.32324 |
| Min length | 4 |
Characters and Unicode
| Total characters | 10366438 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Damage |
|---|---|
| 2nd row | No Damage |
| 3rd row | No Damage |
| 4th row | No Damage |
| 5th row | No Damage |
Common Values
| Value | Count | Frequency (%) |
| No Damage | 650342 | 15.5% |
| Center Front End | 31656 | 0.8% |
| Other | 31361 | 0.7% |
| Right Front Bumper | 26206 | 0.6% |
| Left Front Bumper | 25271 | 0.6% |
| Left Front Quarter Panel | 24371 | 0.6% |
| Right Front Quarter Panel | 21325 | 0.5% |
| Center Back End | 17508 | 0.4% |
| Left Rear Quarter Panel | 15533 | 0.4% |
| Left Rear Bumper | 15177 | 0.4% |
| Other values (9) | 56751 | 1.4% |
| (Missing) | 3270248 |
Length
| Value | Count | Frequency (%) |
| no | 650342 | |
| damage | 650342 | |
| front | 128829 | 6.2% |
| left | 92468 | 4.4% |
| right | 83620 | 4.0% |
| bumper | 78476 | 3.8% |
| quarter | 74879 | 3.6% |
| panel | 74879 | 3.6% |
| rear | 56182 | 2.7% |
| end | 49164 | 2.4% |
| Other values (10) | 152045 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1531471 | |
| e | 1193939 | |
| 1175725 | ||
| o | 829832 | |
| g | 737549 | 7.1% |
| m | 731377 | 7.1% |
| D | 675634 | 6.5% |
| N | 650342 | 6.3% |
| r | 529428 | 5.1% |
| t | 461238 | 4.4% |
| Other values (24) | 1849903 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10366438 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1531471 | |
| e | 1193939 | |
| 1175725 | ||
| o | 829832 | |
| g | 737549 | 7.1% |
| m | 731377 | 7.1% |
| D | 675634 | 6.5% |
| N | 650342 | 6.3% |
| r | 529428 | 5.1% |
| t | 461238 | 4.4% |
| Other values (24) | 1849903 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10366438 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1531471 | |
| e | 1193939 | |
| 1175725 | ||
| o | 829832 | |
| g | 737549 | 7.1% |
| m | 731377 | 7.1% |
| D | 675634 | 6.5% |
| N | 650342 | 6.3% |
| r | 529428 | 5.1% |
| t | 461238 | 4.4% |
| Other values (24) | 1849903 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10366438 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1531471 | |
| e | 1193939 | |
| 1175725 | ||
| o | 829832 | |
| g | 737549 | 7.1% |
| m | 731377 | 7.1% |
| D | 675634 | 6.5% |
| N | 650342 | 6.3% |
| r | 529428 | 5.1% |
| t | 461238 | 4.4% |
| Other values (24) | 1849903 |
PUBLIC_PROPERTY_DAMAGE
Categorical
IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1528858 |
| Missing (%) | 36.5% |
| Memory size | 243.3 MiB |
| N | |
|---|---|
| Unspecified | |
| Y | 15192 |
Length
| Max length | 11 |
|---|---|
| Median length | 1 |
| Mean length | 2.1968011 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5836661 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N |
|---|---|
| 2nd row | N |
| 3rd row | N |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 2323722 | |
| Unspecified | 317977 | 7.6% |
| Y | 15192 | 0.4% |
| (Missing) | 1528858 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 2323722 | |
| unspecified | 317977 | 12.0% |
| y | 15192 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2323722 | |
| e | 635954 | 10.9% |
| i | 635954 | 10.9% |
| U | 317977 | 5.4% |
| n | 317977 | 5.4% |
| s | 317977 | 5.4% |
| p | 317977 | 5.4% |
| c | 317977 | 5.4% |
| f | 317977 | 5.4% |
| d | 317977 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5836661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 2323722 | |
| e | 635954 | 10.9% |
| i | 635954 | 10.9% |
| U | 317977 | 5.4% |
| n | 317977 | 5.4% |
| s | 317977 | 5.4% |
| p | 317977 | 5.4% |
| c | 317977 | 5.4% |
| f | 317977 | 5.4% |
| d | 317977 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5836661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 2323722 | |
| e | 635954 | 10.9% |
| i | 635954 | 10.9% |
| U | 317977 | 5.4% |
| n | 317977 | 5.4% |
| s | 317977 | 5.4% |
| p | 317977 | 5.4% |
| c | 317977 | 5.4% |
| f | 317977 | 5.4% |
| d | 317977 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5836661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 2323722 | |
| e | 635954 | 10.9% |
| i | 635954 | 10.9% |
| U | 317977 | 5.4% |
| n | 317977 | 5.4% |
| s | 317977 | 5.4% |
| p | 317977 | 5.4% |
| c | 317977 | 5.4% |
| f | 317977 | 5.4% |
| d | 317977 | 5.4% |
PUBLIC_PROPERTY_DAMAGE_TYPE
Text
MISSING 
| Distinct | 19198 |
|---|---|
| Distinct (%) | 73.2% |
| Missing | 4159532 |
| Missing (%) | 99.4% |
| Memory size | 129.3 MiB |
Length
| Max length | 866 |
|---|---|
| Median length | 383 |
| Mean length | 38.39852 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1006694 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18215 ? |
|---|---|
| Unique (%) | 69.5% |
Sample
| 1st row | UTILITY POLE |
|---|---|
| 2nd row | PASSENGER FRONT SIDE DAMAGED |
| 3rd row | FENCE OF A SCHOOL IN THE BACK |
| 4th row | POWERLINE CABLES IN FRONT OF 4236 BEDFORD AVENUE |
| 5th row | BRICK FENCE WAS STRUCK BY MV1 WHEN TRYING TO PARK. |
| Value | Count | Frequency (%) |
| fence | 6179 | 3.6% |
| of | 5954 | 3.4% |
| and | 4795 | 2.8% |
| to | 4379 | 2.5% |
| the | 3906 | 2.2% |
| pole | 3753 | 2.2% |
| damage | 3383 | 1.9% |
| front | 2954 | 1.7% |
| vehicle | 2698 | 1.6% |
| light | 2477 | 1.4% |
| Other values (11801) | 133124 |
Most occurring characters
| Value | Count | Frequency (%) |
| 147385 | ||
| E | 98251 | 9.8% |
| A | 68999 | 6.9% |
| T | 65545 | 6.5% |
| O | 62299 | 6.2% |
| N | 61591 | 6.1% |
| I | 55333 | 5.5% |
| R | 52986 | 5.3% |
| D | 42379 | 4.2% |
| L | 40232 | 4.0% |
| Other values (51) | 311694 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1006694 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 147385 | ||
| E | 98251 | 9.8% |
| A | 68999 | 6.9% |
| T | 65545 | 6.5% |
| O | 62299 | 6.2% |
| N | 61591 | 6.1% |
| I | 55333 | 5.5% |
| R | 52986 | 5.3% |
| D | 42379 | 4.2% |
| L | 40232 | 4.0% |
| Other values (51) | 311694 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1006694 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 147385 | ||
| E | 98251 | 9.8% |
| A | 68999 | 6.9% |
| T | 65545 | 6.5% |
| O | 62299 | 6.2% |
| N | 61591 | 6.1% |
| I | 55333 | 5.5% |
| R | 52986 | 5.3% |
| D | 42379 | 4.2% |
| L | 40232 | 4.0% |
| Other values (51) | 311694 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1006694 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 147385 | ||
| E | 98251 | 9.8% |
| A | 68999 | 6.9% |
| T | 65545 | 6.5% |
| O | 62299 | 6.2% |
| N | 61591 | 6.1% |
| I | 55333 | 5.5% |
| R | 52986 | 5.3% |
| D | 42379 | 4.2% |
| L | 40232 | 4.0% |
| Other values (51) | 311694 |
MISSING 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 148303 |
| Missing (%) | 3.5% |
| Memory size | 286.8 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 16.314718 |
| Min length | 1 |
Characters and Unicode
| Total characters | 65869792 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Driver Inattention/Distraction |
| 3rd row | Driver Inattention/Distraction |
| 4th row | Unspecified |
| 5th row | Other Vehicular |
| Value | Count | Frequency (%) |
| unspecified | 2372696 | |
| driver | 554052 | 8.5% |
| inattention/distraction | 514304 | 7.9% |
| too | 193911 | 3.0% |
| closely | 193911 | 3.0% |
| to | 171857 | 2.6% |
| failure | 149257 | 2.3% |
| yield | 142195 | 2.2% |
| right-of-way | 142195 | 2.2% |
| following | 133086 | 2.0% |
| Other values (96) | 1955174 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8576868 | |
| e | 8049142 | 12.2% |
| n | 5785046 | 8.8% |
| s | 4067067 | 6.2% |
| t | 3458998 | 5.3% |
| c | 3426454 | 5.2% |
| r | 2949697 | 4.5% |
| o | 2880880 | 4.4% |
| d | 2852972 | 4.3% |
| f | 2800665 | 4.3% |
| Other values (45) | 21022003 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 65869792 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 8576868 | |
| e | 8049142 | 12.2% |
| n | 5785046 | 8.8% |
| s | 4067067 | 6.2% |
| t | 3458998 | 5.3% |
| c | 3426454 | 5.2% |
| r | 2949697 | 4.5% |
| o | 2880880 | 4.4% |
| d | 2852972 | 4.3% |
| f | 2800665 | 4.3% |
| Other values (45) | 21022003 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 65869792 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 8576868 | |
| e | 8049142 | 12.2% |
| n | 5785046 | 8.8% |
| s | 4067067 | 6.2% |
| t | 3458998 | 5.3% |
| c | 3426454 | 5.2% |
| r | 2949697 | 4.5% |
| o | 2880880 | 4.4% |
| d | 2852972 | 4.3% |
| f | 2800665 | 4.3% |
| Other values (45) | 21022003 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 65869792 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 8576868 | |
| e | 8049142 | 12.2% |
| n | 5785046 | 8.8% |
| s | 4067067 | 6.2% |
| t | 3458998 | 5.3% |
| c | 3426454 | 5.2% |
| r | 2949697 | 4.5% |
| o | 2880880 | 4.4% |
| d | 2852972 | 4.3% |
| f | 2800665 | 4.3% |
| Other values (45) | 21022003 |
MISSING 
| Distinct | 56 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1688054 |
| Missing (%) | 40.3% |
| Memory size | 220.0 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.748036 |
| Min length | 1 |
Characters and Unicode
| Total characters | 34338402 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unsafe Lane Changing |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 1958940 | |
| driver | 171806 | 5.1% |
| inattention/distraction | 139153 | 4.1% |
| too | 85262 | 2.5% |
| closely | 85262 | 2.5% |
| lane | 61730 | 1.8% |
| passing | 59541 | 1.8% |
| following | 58590 | 1.7% |
| unsafe | 55755 | 1.6% |
| to | 51168 | 1.5% |
| Other values (94) | 654367 | 19.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5109609 | |
| i | 5058714 | |
| n | 3051663 | |
| s | 2501830 | 7.3% |
| c | 2247984 | 6.5% |
| p | 2148389 | 6.3% |
| d | 2118406 | 6.2% |
| f | 2115093 | 6.2% |
| U | 2068417 | 6.0% |
| r | 938728 | 2.7% |
| Other values (43) | 6979569 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34338402 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5109609 | |
| i | 5058714 | |
| n | 3051663 | |
| s | 2501830 | 7.3% |
| c | 2247984 | 6.5% |
| p | 2148389 | 6.3% |
| d | 2118406 | 6.2% |
| f | 2115093 | 6.2% |
| U | 2068417 | 6.0% |
| r | 938728 | 2.7% |
| Other values (43) | 6979569 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34338402 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5109609 | |
| i | 5058714 | |
| n | 3051663 | |
| s | 2501830 | 7.3% |
| c | 2247984 | 6.5% |
| p | 2148389 | 6.3% |
| d | 2118406 | 6.2% |
| f | 2115093 | 6.2% |
| U | 2068417 | 6.0% |
| r | 938728 | 2.7% |
| Other values (43) | 6979569 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34338402 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5109609 | |
| i | 5058714 | |
| n | 3051663 | |
| s | 2501830 | 7.3% |
| c | 2247984 | 6.5% |
| p | 2148389 | 6.3% |
| d | 2118406 | 6.2% |
| f | 2115093 | 6.2% |
| U | 2068417 | 6.0% |
| r | 938728 | 2.7% |
| Other values (43) | 6979569 |